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(1) GENERAL INFORMATION: 

(i) APPLICANT: Ni et al . 

(ii) TITLE OF INVENTION: PANCREAS -DERIVED PLASMINOGEN ACTIVATOR 
INHIBITOR 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: STERNE, KESSLER, GOLDSTEIN & FOX P.L.L.C. 

(B) STREET: 1100 NEW YORK AVENUE, SUITE 600 

(C) CITY: WASHINGTON 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005-3934 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US To be assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/934,011 

(B) FILING DATE: 15 -AUG- 19 97 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/024,056 

(B) FILING DATE: 16-AUG-1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: STEFFE, ERIC K. 

(B) REGISTRATION NUMBER: 36,688 

(C) REFERENCE/DOCKET NUMBER: 1488.0300002 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-371-2600 

(B) TELEFAX: 202-371-2540 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 67 . . 1242 



FEATURE : 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 109.. 1242 



FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 



sig_peptide 
67 . .108 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GGCACGAGGG AAAACTCTAT TTTGAAAATG AATATATTTT GATTTAAACA ATACAGAGAA 60 

GTCAAA ATG GAC AC A ATC TTC TTG TGG AGT CTT CTA TTG CTG TTT TTT 108 
Met Asp Thr lie Phe Leu Trp Ser Leu Leu Leu Leu Phe Phe 
-14 -10 -5 

GGA AGT CAA GCC TCA AGA TGC TCA GCT CAA AAA AAT ACC GAA TTT GCA 156 
Gly Ser Gin Ala Ser Arg Cys Ser Ala Gin Lys Asn Thr Glu Phe Ala 
15 10 15 

GTG GAT CTT TAT CAA GAG GTT TCC TTA TCT CAT AAG GAC AAC ATT ATA 2 04 

Val Asp Leu Tyr Gin Glu Val Ser Leu Ser His Lys Asp Asn lie lie 
20 25 30 

TTT TCA CCC CTT GGA ATA ACT TTG GTT CTT GAG ATG GTA CAA CTG GGA 2 52 

Phe Ser Pro Leu Gly lie Thr Leu Val Leu Glu Met Val Gin Leu Gly 
35 40 45 

GCC AAA GGA AAA GCA CAG CAG CAG ATA AGA CAA ACT TTA AAA CAA CAG 300 
Ala Lys Gly Lys Ala Gin Gin Gin lie Arg Gin Thr Leu Lys Gin Gin 
50 55 60 

GAA ACC TCA GCT GGG GAA GAA TTT TTG GTA CTG AAG TCA TTT TGC TCT 348 
Glu Thr Ser Ala Gly Glu Glu Phe Leu Val Leu Lys Ser Phe Cys Ser 
65 70 75 80 

GCC ATC TCA GAG AAA AAA CAA GAA TTT AC A TTT AAT CTT GCC AAT GCC 396 
Ala lie Ser Glu Lys Lys Gin Glu Phe Thr Phe Asn Leu Ala Asn Ala 
85 90 95 

CTC TAC CTT CAA GAA GGA TTC ACT GTG AAA GAA CAG TAT CTC CAT GGC 444 
Leu Tyr Leu Gin Glu Gly Phe Thr Val Lys Glu Gin Tyr Leu His Gly 
100 105 110 

AAC AAG GAA TTT TTT CAG AGT GCT ATA AAA CTG GTG GAT TTT CAA GAT 492 
Asn Lys Glu Phe Phe Gin Ser Ala lie Lys Leu Val Asp Phe Gin Asp 
115 120 125 

GCA AAG GCT TGT GCA GAG ATG ATA AGT ACC TGG GTA GAA AGA AAA AC A 540 
Ala Lys Ala Cys Ala Glu Met lie Ser Thr Trp Val Glu Arg Lys Thr 
130 135 140 

GAT GGA AAA ATT AAA GAC ATG TTT TCA GGG GAA GAA TTT GGC CCT CTG 588 
Asp Gly Lys lie Lys Asp Met Phe Ser Gly Glu Glu Phe Gly Pro Leu 
145 150 155 160 

ACT CGG CTT GTC CTG GTG AAT GCT ATT TAT TTC AAA GGA GAT TGG AAA 63 6 

Thr Arg Leu Val Leu Val Asn Ala lie Tyr Phe Lys Gly Asp Trp Lys 
165 170 175 
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CAG AAA TTC AGA AAA GAG GAC AC A CAG CTG ATA AAT TTT ACT AAG AAA 684 
Gin Lys Phe Arg Lys Glu Asp Thr Gin Leu lie Asn Phe Thr Lys Lys 
180 185 190 

AAT GGT TCA ACT GTC AAA ATT CCA ATG ATG AAG GCT CTT CTG AGA ACA 732 
Asn Gly Ser Thr Val Lys lie Pro Met Met Lys Ala Leu Leu Arg Thr 
195 200 205 

AAA TAT GGT TAT TTT TCT GAA TCT TCC CTG AAC TAC CAA GTT TTA GAA 7 80 

Lys Tyr Gly Tyr Phe Ser Glu Ser Ser Leu Asn Tyr Gin Val Leu Glu 
210 215 220 

TTG TCT TAC AAA GGT GAT GAA TTT AGC TTA ATT ATC ATA CTT CCT GCA 828 
Leu Ser Tyr Lys Gly Asp Glu Phe Ser Leu lie lie lie Leu Pro Ala 
225 230 235 240 

GAA GGT ATG GAT ATA GAA GAA GTG GAA AAA CTA ATT ACT GCT CAA CAA 87 6 

Glu Gly Met Asp lie Glu Glu Val Glu Lys Leu lie Thr Ala Gin Gin 
245 250 255 

ATC CTA AAA TGG CTC TCT GAG ATG CAA GAA GAG GAA GTA GAA ATA AGC 924 
lie Leu Lys Trp Leu Ser Glu Met Gin Glu Glu Glu Val Glu lie Ser 
260 265 270 

CTC CCT AGA TTT AAA GTA GAA CAA AAA GTA GAC TTC AAA GAC GTT TTG 972 
Leu Pro Arg Phe Lys Val Glu Gin Lys Val Asp Phe Lys Asp Val Leu 
275 280 285 

TAT TCT TTG AAC ATA ACC GAG ATA TTT AGT GGT GGC TGC GAC CTT TCT 102 0 

Tyr Ser Leu Asn lie Thr Glu lie Phe Ser Gly Gly Cys Asp Leu Ser 
290 295 300 

GGA ATA ACA GAT TCA TCT GAA GTG TAT GTT TCC CAA GTG ACG CAA AAA 1068 
Gly lie Thr Asp Ser Ser Glu Val Tyr Val Ser Gin Val Thr Gin Lys 
305 310 315 320 

GTT TTC TTT GAG ATA AAT GAA GAT GGT AGT GAA GCT GCA ACA TCA ACT 1116 
Val Phe Phe Glu lie Asn Glu Asp Gly Ser Glu Ala Ala Thr Ser Thr 
325 330 335 

GGC ATA CAC ATC CCT GTG ATC ATG AGT CTG GCT CAA AGC CAA TTT ATA 1164 
Gly lie His lie Pro Val lie Met Ser Leu Ala Gin Ser Gin Phe lie 
340 345 350 

GCA AAT CAT CCA TTT CTG TTT ATT ATG AAG CAT AAT CCA ACA GAA TCA 1212 
Ala Asn His Pro Phe Leu Phe lie Met Lys His Asn Pro Thr Glu Ser 
355 360 365 

ATT CTG TTT ATG GGA AGA GTG ACA AAT CCC TGACACCCAG GAGATAAAAG 12 62 

lie Leu Phe Met Gly Arg Val Thr Asn Pro 
370 375 

GAAGAGATTT AGATTCACTG TGAATGAAAA GCACAGCCTC AGAATAAAAG ATGATTTCTC 1322 

AAAAATAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 1371 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asp Thr lie Phe Leu Trp Ser Leu Leu Leu Leu Phe Phe Gly Ser 
-14 -10 -5 1 

Gin Ala Ser Arg Cys Ser Ala Gin Lys Asn Thr Glu Phe Ala Val Asp 
5 10 15 

Leu Tyr Gin Glu Val Ser Leu Ser His Lys Asp Asn lie lie Phe Ser 
20 25 30 

Pro Leu Gly lie Thr Leu Val Leu Glu Met Val Gin Leu Gly Ala Lys 
35 40 45 50 

Gly Lys Ala Gin Gin Gin lie Arg Gin Thr Leu Lys Gin Gin Glu Thr 
55 60 65 

Ser Ala Gly Glu Glu Phe Leu Val Leu Lys Ser Phe Cys Ser Ala lie 
70 75 80 

Ser Glu Lys Lys Gin Glu Phe Thr Phe Asn Leu Ala Asn Ala Leu Tyr 
85 90 95 

Leu Gin Glu Gly Phe Thr Val Lys Glu Gin Tyr Leu His Gly Asn Lys 
100 105 110 

Glu Phe Phe Gin Ser Ala lie Lys Leu Val Asp Phe Gin Asp Ala Lys 
115 120 125 130 

Ala Cys Ala Glu Met lie Ser Thr Trp Val Glu Arg Lys Thr Asp Gly 
135 140 145 

Lys lie Lys Asp Met Phe Ser Gly Glu Glu Phe Gly Pro Leu Thr Arg 
150 155 160 

Leu Val Leu Val Asn Ala lie Tyr Phe Lys Gly Asp Trp Lys Gin Lys 
165 170 175 

Phe Arg Lys Glu Asp Thr Gin Leu lie Asn Phe Thr Lys Lys Asn Gly 
180 185 190 

Ser Thr Val Lys lie Pro Met Met Lys Ala Leu Leu Arg Thr Lys Tyr 
195 200 205 210 

Gly Tyr Phe Ser Glu Ser Ser Leu Asn Tyr Gin Val Leu Glu Leu Ser 
215 220 225 

Tyr Lys Gly Asp Glu Phe Ser Leu lie lie lie Leu Pro Ala Glu Gly 
230 235 240 

Met Asp lie Glu Glu Val Glu Lys Leu lie Thr Ala Gin Gin lie Leu 
245 250 255 

Lys Trp Leu Ser Glu Met Gin Glu Glu Glu Val Glu lie Ser Leu Pro 
260 265 270 

Arg Phe Lys Val Glu Gin Lys Val Asp Phe Lys Asp Val Leu Tyr Ser 
275 280 285 290 



Leu Asn lie Thr Glu lie Phe Ser 
295 

Thr Asp Ser Ser Glu Val Tyr Val 
310 

Phe Glu lie Asn Glu Asp Gly Ser 
325 330 

His lie Pro Val lie Met Ser Leu 
340 345 

His Pro Phe Leu Phe lie Met Lys 
355 360 

Phe Met Gly Arg Val Thr Asn Pro 
375 



Gly Gly Cys Asp Leu Ser Gly He 
300 305 

Ser Gin Val Thr Gin Lys Val Phe 
315 320 

Glu Ala Ala Thr Ser Thr Gly He 
335 

Ala Gin Ser Gin Phe He Ala Asn 
350 

His Asn Pro Thr Glu Ser He Leu 
365 370 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Gin Met Ser Pro Ala Leu Thr Cys Leu Val Leu Gly Leu Ala Leu 
15 10 15 

Val Phe Gly Glu Gly Ser Ala Val His His Pro Pro Ser Tyr Val Ala 
20 25 30 

His Leu Ala Ser Asp Phe Gly Val Arg Val Phe Gin Gin Val Ala Gin 
35 40 45 

Ala Ser Lys Asp Arg Asn Val Val Phe Ser Pro Tyr Gly Val Ala Ser 
50 55 60 

Val Leu Ala Met Leu Gin Leu Thr Thr Gly Gly Glu Thr Gin Gin Gin 
65 70 75 80 

He Gin Ala Ala Met Gly Phe Lys He Asp Asp Lys Gly Met Ala Pro 
85 90 95 

Ala Leu Arg His Leu Tyr Lys Glu Leu Met Gly Pro Trp Asn Lys Asp 
100 105 110 

Glu He Ser Thr Thr Asp Ala lie Phe Val Gin Arg Asp Leu Lys Leu 
115 120 125 

Val Gin Gly Phe Met Pro His Phe Phe Arg Leu Phe Arg Ser Thr Val 
130 135 140 

Lys Gin Val Asp Phe Ser Glu Val Glu Arg Ala Arg Phe He He Asn 



145 150 155 160 

Asp Trp Val Lys Thr His Thr Lys Gly Met lie Ser Asn Leu Leu Gly 
165 170 175 

Lys Gly Ala Val Asp Gin Leu Thr Arg Leu Val Leu Val Asn Ala Leu 
180 185 190 

Tyr Phe Asn Gly Gin Trp Lys Thr Pro Phe Pro Asp Ser Ser Thr His 
195 200 205 

Arg Arg Leu Phe His Lys Ser Asp Gly Ser Thr Val Ser Val Pro Met 
210 215 220 

Met Ala Gin Thr Asn Lys Phe Asn Tyr Thr Glu Phe Thr Thr Pro Asp 
225 230 235 240 

Gly His Tyr Tyr Asp lie Leu Glu Leu Pro Tyr His Gly Asp Thr Leu 
245 250 255 

Ser Met Phe lie Ala Ala Pro Tyr Glu Lys Glu Val Pro Leu Ser Ala 
260 265 270 

Leu Thr Asn lie Leu Ser Ala Gin Leu lie Ser His Trp Lys Gly Asn 
275 280 285 

Met Thr Arg Leu Pro Arg Leu Leu Val Leu Pro Lys Phe Ser Leu Glu 
290 295 300 

Thr Glu Val Asp Leu Arg Lys Pro Leu Glu Asn Leu Gly Met Thr Asp 
305 310 315 320 

Met Phe Arg Gin Phe Gin Ala Asp Phe Thr Ser Leu Ser Asp Gin Glu 
325 330 335 

Pro Leu His Val Ala Gin Ala Leu Gin Lys Val Lys lie Glu Val Asn 
340 345 350 

Glu Ser Gly Thr Val Ala Ser Ser Ser Thr Ala Val lie Val Ser Ala 
355 360 365 

Arg Met Ala Pro Glu Glu lie lie Met Asp Arg Pro Phe Leu Phe Val 
370 375 380 

Val Arg His Asn Pro Thr Gly Thr Val Leu Phe Met Gly Gin Val Met 
385 390 395 400 

Glu Pro 



INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Glu Asp Leu Cys Val Ala Asn Thr Leu Phe Ala Leu Asn Leu Phe 
15 10 15 

Lys His Leu Ala Lys Ala Ser Pro Thr Gin Asn Leu Phe Leu Ser Pro 
20 25 30 

Trp Ser lie Ser Ser Thr Met Ala Met Val Tyr Met Gly Ser Arg Gly 
35 40 45 

Ser Thr Glu Asp Gin Met Ala Lys Val Leu Gin Phe Asn Glu Val Gly 
50 55 60 

Ala Asn Ala Val Thr Pro Met Thr Pro Glu Asn Phe Thr Ser Cys Gly 
65 70 75 " 80 

Phe Met Gin Gin lie Gin Lys Gly Ser Tyr Pro Asp Ala lie Leu Gin 
85 90 95 

Ala Gin Ala Ala Asp Lys lie His Ser Ser Phe Arg Ser Leu Ser Ser 
100 105 110 

Ala lie Asn Ala Ser Thr Gly Asp Tyr Leu Leu Glu Ser Val Asn Lys 
115 120 125 

Leu Phe Gly Glu Lys Ser Ala Ser Phe Arg Glu Glu Tyr lie Arg Leu 
130 135 140 

Cys Gin Lys Tyr Tyr Ser Ser Glu Pro Gin Ala Val Asp Phe Leu Glu 
145 150 155 160 

Cys Ala Glu Glu Ala Arg Lys Lys lie Asn Ser Trp Val Lys Thr Gin 
165 170 175 

Thr Lys Gly Lys lie Pro Asn Leu Leu Pro Glu Gly Ser Val Asp Gly 
180 185 190 

Asp Thr Arg Met Val Leu Val Asn Ala Val Tyr Phe Lys Gly Lys Trp 
195 200 205 

Lys Thr Pro Phe Glu Lys Lys Leu Asn Gly Leu Tyr Pro Phe Arg Val 
210 215 220 

Asn Ser Ala Gin Arg Thr Pro Val Gin Met Met Tyr Leu Arg Glu Lys 
225 230 235 240 

Leu Asn lie Gly Tyr lie Glu Asp Leu Lys Ala Gin lie Leu Glu Leu 
245 250 255 

Pro Tyr Ala Gly Asp Val Ser Met Phe Leu Leu Leu Pro Asp Glu lie 
260 265 270 

Ala Asp Val Ser Thr Gly Leu Glu Leu Leu Glu Ser Glu lie Thr Tyr 
275 280 285 

Asp Lys Leu Asn Lys Trp Thr Ser Lys Asp Lys Met Ala Glu Asp Glu 
290 295 300 

Val Glu Val Tyr lie Pro Gin Phe Lys Leu Glu Glu His Tyr Glu Leu 
305 310 315 320 



Arg Ser lie Leu Arg Ser Met Gly Met Glu Asp Ala Phe Asn Lys Gly 
325 330 335 

Arg Ala Asn Phe Ser Gly Met Ser Glu Arg Asn Asp Leu Phe Leu Ser 
340 345 ^ 350 

Glu Val Phe His Gin Ala Met Val Asp Val Asn Glu Glu Gly Thr Glu 
355 360 365 

Ala Ala Ala Gly Thr Gly Gly Val Met Thr Gly Arg Thr Gly His Gly 
370 375 380 

Gly Pro Gin Phe Val Ala Asp His Pro Phe Leu Phe Leu lie Met His 
385 390 395 400 

Lys lie Thr Lys Cys lie Leu Phe Phe Gly Arg Phe Cys Ser Pro 
405 410 415 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGCCCATGGG AAGTCAAGCC TCAAG 25 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGCAAGCTTT CACTTCCTTT TATCTCCCTG 3 0 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CGCGGATCCG CCATCATGGA CACAATCTTC TTG 33 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CGCGGTACCT CACTTCCTTT TATCTCCCTG 30 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CGCTCTAGAT CAAGCGTAGT CTGGGACGTC GTATGGGTAG GGATTTGTCA CTCTTCC 57 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

NAATATATTT NNATTTAAAC AATACAGAGA AGTCAAAATG GACACAATCT TC TTGTGG AG 60 

TCTTCTATTG CTGTTTTTTC GAAGTCAAGC CTCANGAATG CTCAGCTGCA AAAAAATACC 12 0 

G AATTTGC C A GTGGNATCTT TATCAAGAGG TTTCCTTCAT CTGCATAAGG N 171 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCANNANAA C AATC TNATC CAAGGACTGT GGNACTCCTG TTCCCTGCTC ATCATGTCAT 60 

GGGGCATCTG CCAGGAACCA TCTTTGATGG TGTAAAAATC TTGAATACAT AAGAGGGAAA 120 

TTTT AG AC TT GTTAGAAAGA AGCCAAGCAA TTGAGACCTT AGATAGAACT TAGAATTCTC 180 

GCCGAGTTTT GTTGGGTAAT TGTTACTTCA AAAAAAAATG CAATTTCTGT TCCCTCTTTC 240 

CTCCAACCAT TTATC TGGG A AGCAAGTTAT TGGCAACCCA GAGCTGATTG TTGGAGCCGG 3 00 

GGAAAATGGT GTGAAATGTG AGAAAATGTA ATTGAGATAA TAAAAACAAA AGATTTTACA 3 60 

ATATATTATC CTCTAAGTCA TCCATTAAAA AATTGGTAGC AAAAATGTGC AGTGTTTCAA 42 0 

G AC TTTTC TT TTCTTTTTTT TTNAATACCA GATTAAAGTA GAC C AAAAAG TAGACTCCAA 480 

AGACGTTTGG ATNCTTGAAC ATAACCGNGA TATTA 515 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1370 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 67 1281 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 67.. 120 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 121.. 1281 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGCACGAGGG AAAACTCTAT TTTGAAAATG AATATATTTT GATTTAAACA ATACAGAGAA 60 
GTCAAA ATG GAC ACA ATC TTC TTG TGG AGT CTT CTA TTG CTG TTT TTT 108 
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Met Asp Thr lie Phe Leu Trp Ser Leu Leu Leu Leu Phe Phe 
-18 -15 -10 -5 



GGA AGT CAA GCC TCA AGA TGC TCA GCT CAA AAA AAT ACC GAA TTT GCA 156 

Gly Ser Gin Ala Ser Arg Cys Ser Ala Gin Lys Asn Thr Glu Phe Ala 

15 10 

GTG GAT CTT TAT CAA GAG GTT TCC TTA TCT CAT AAG GAC AAC ATT ATA 204 

Val Asp Leu Tyr Gin Glu Val Ser Leu Ser His Lys Asp Asn lie lie 
15 20 25 

TTT TCA CCC CTT GGA ATA ACT TTG GTT CTT GAG ATG GTA CAA CTG GGA 252 

Phe Ser Pro Leu Gly lie Thr Leu Val Leu Glu Met Val Gin Leu Gly 
30 35 40 

GCC AAA GGA AAA GCA CAG CAG CAG ATA AGA CAA ACT TTA AAA CAA CAG 3 00 

Ala Lys Gly Lys Ala Gin Gin Gin lie Arg Gin Thr Leu Lys Gin Gin 

45 50 55 60 

GAA ACC TCA GCT GGG GAA GAA TTT TTG GTA CTG AAG TCA TTT TGC TCT 348 

Glu Thr Ser Ala Gly Glu Glu Phe Leu Val Leu Lys Ser Phe Cys Ser 

65 70 75 

GCC ATC TCA GAG AAA AAA CAA GAA TTT ACA TTT AAT CTT GCC AAT GCC 3 96 

Ala lie Ser Glu Lys Lys Gin Glu Phe Thr Phe Asn Leu Ala Asn Ala 
80 85 90 

CTC TAC CTT CAA GAA GGA TTC ACT GTG AAA GAA CAG TAT CTC CAT GGC 444 

Leu Tyr Leu Gin Glu Gly Phe Thr Val Lys Glu Gin Tyr Leu His Gly 
95 100 105 

AAC AAG GAA TTT TTT CAG AGT GCT ATA AAA CTG GTG GAT TTT CAA GAT 492 

Asn Lys Glu Phe Phe Gin Ser Ala lie Lys Leu Val Asp Phe Gin Asp 
110 115 120 

GCA AAG GCT TGT GCA GAG ATG ATA AGT ACC TGG GTA GAA AGA AAA ACA 540 

Ala Lys Ala Cys Ala Glu Met lie Ser Thr Trp Val Glu Arg Lys Thr 

125 130 135 140 

GAT GGA AAA ATT AAA GAC ATG TTT TCA GGG GAA GAA TTT GGC CCT CTG 588 

Asp Gly Lys lie Lys Asp Met Phe Ser Gly Glu Glu Phe Gly Pro Leu 

145 150 155 

ACT CGG CTT GTC CTG GTG AAT GCT ATT TAT TTC AAA GGA GAT TGG AAA 63 6 

Thr Arg Leu Val Leu Val Asn Ala lie Tyr Phe Lys Gly Asp Trp Lys 
160 165 170 

CAG AAA TTC AGA AAA GAG GAC ACA CAG CTG ATA AAT TTT ACT AAG AAA 684 

Gin Lys Phe Arg Lys Glu Asp Thr Gin Leu lie Asn Phe Thr Lys Lys 
175 180 185 

AAT GGT TCA ACT GTC AAA ATT CCA ATG ATG AAG GCT CTT CTG AGA ACA 732 

Asn Gly Ser Thr Val Lys lie Pro Met Met Lys Ala Leu Leu Arg Thr 
190 195 200 

AAA TAT GGT TAT TTT TCT GAA TCT TCC CTG AAC TAC CAA GTT TTA GAA 780 

Lys Tyr Gly Tyr Phe Ser Glu Ser Ser Leu Asn Tyr Gin Val Leu Glu 

205 210 215 220 

TTG TCT TAC AAA GGT GAT GAA TTT AGC TTA ATT ATC ATA CTT CCT GCA 82 8 

Leu Ser Tyr Lys Gly Asp Glu Phe Ser Leu lie lie lie Leu Pro Ala 

225 230 235 
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GAA GGT ATG GAT ATA GAA GAA GTG GAA AAA CTA ATT ACT GCT CAA CAA 87 6 

Glu Gly Met Asp lie Glu Glu Val Glu Lys Leu lie Thr Ala Gin Gin 
240 245 250 



ATC CTA AAA TGG CTC TCT GAG ATG CAA GAA GAG GAA GTA GAA ATA AGC 924 

He Leu Lys Trp Leu Ser Glu Met Gin Glu Glu Glu Val Glu He Ser 
255 260 265 

CTC CCT AGA TTT AAA GTA GAA CAA AAA GTA GAC TTC AAA GAC GTT TTG 972 

Leu Pro Arg Phe Lys Val Glu Gin Lys Val Asp Phe Lys Asp Val Leu 

270 275 280 

TAT TCT TTG AAC ATA ACC GAG ATA TTT AGT GGT GGC TGC GAC CTT TCT 102 0 

Tyr Ser Leu Asn He Thr Glu He Phe Ser Gly Gly Cys Asp Leu Ser 
285 290 295 300 

GGA ATA AC A GAT TCA TCT GAA GTG TAT GTT TCC CAA GTG ACG CAA AAA 1068 

Gly He Thr Asp Ser Ser Glu Val Tyr Val Ser Gin Val Thr Gin Lys 
305 310 315 

GTT TTC TTT GAG ATA AAT GAA GAT GGT AGT GAA GCT GCA AC A TCA ACT 1116 

Val Phe Phe Glu He Asn Glu Asp Gly Ser Glu Ala Ala Thr Ser Thr 
320 325 330 

GGC ATA CAC ATC CCT GTG ATC ATG AGT CTG GCT CAA AGC CAA TTT ATA 1164 

Gly He His He Pro Val He Met Ser Leu Ala Gin Ser Gin Phe He 
335 340 345 

GCA AAT CAT CCA TTT CTG TTT ATT ATG AAG CAT AAT CCA ACA GAA TCA 1212 

Ala Asn His Pro Phe Leu Phe He Met Lys His Asn Pro Thr Glu Ser 

350 355 360 

ATT CTG TTT ATG GGA AGA GTG ACA AAT CCT GAC ACC CAG GAG ATA AAA 12 60 

He Leu Phe Met Gly Arg Val Thr Asn Pro Asp Thr Gin Glu He Lys 
365 370 375 380 

GGA AGA GAT TTA GAT TCA CTG TGAATGAAAA GCACAGCCTC AGAATAAAAG 1311 
Gly Arg Asp Leu Asp Ser Leu 
385 

ATGATTTCTC AAAAATAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 137 0 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Asp Thr He Phe Leu Trp Ser Leu Leu Leu Leu Phe Phe Gly Ser 
-18 -15 -10 -5 

Gin Ala Ser Arg Cys Ser Ala Gin Lys Asn Thr Glu Phe Ala Val Asp 
15 10 

Leu Tyr Gin Glu Val Ser Leu Ser His Lys Asp Asn He He Phe Ser 
15 20 25 30 
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Pro Leu Gly lie Thr Leu Val Leu Glu Met Val Gin Leu Gly Ala Lys 
35 40 45 

Gly Lys Ala Gin Gin Gin lie Arg Gin Thr Leu Lys Gin Gin Glu Thr 
50 55 60 

Ser Ala Gly Glu Glu Phe Leu Val Leu Lys Ser Phe Cys Ser Ala lie 
65 70 75 

Ser Glu Lys Lys Gin Glu Phe Thr Phe Asn Leu Ala Asn Ala Leu Tyr 
80 85 90 

Leu Gin Glu Gly Phe Thr Val Lys Glu Gin Tyr Leu His Gly Asn Lys 
95 100 105 110 

Glu Phe Phe Gin Ser Ala lie Lys Leu Val Asp Phe Gin Asp Ala Lys 
115 120 125 

Ala Cys Ala Glu Met lie Ser Thr Trp Val Glu Arg Lys Thr Asp Gly 
130 135 140 

Lys lie Lys Asp Met Phe Ser Gly Glu Glu Phe Gly Pro Leu Thr Arg 
145 150 155 

Leu Val Leu Val Asn Ala lie Tyr Phe Lys Gly Asp Trp Lys Gin Lys 
160 165 170 

Phe Arg Lys Glu Asp Thr Gin Leu lie Asn Phe Thr Lys Lys Asn Gly 
175 180 185 190 

Ser Thr Val Lys lie Pro Met Met Lys Ala Leu Leu Arg Thr Lys Tyr 
195 200 205 

Gly Tyr Phe Ser Glu Ser Ser Leu Asn Tyr Gin Val Leu Glu Leu Ser 
210 215 220 

Tyr Lys Gly Asp Glu Phe Ser Leu lie lie lie Leu Pro Ala Glu Gly 
225 230 235 

Met Asp lie Glu Glu Val Glu Lys Leu lie Thr Ala Gin Gin lie Leu 
240 245 250 

Lys Trp Leu Ser Glu Met Gin Glu Glu Glu Val Glu lie Ser Leu Pro 
255 260 265 270 

Arg Phe Lys Val Glu Gin Lys Val Asp Phe Lys Asp Val Leu Tyr Ser 
275 280 285 

Leu Asn lie Thr Glu lie Phe Ser Gly Gly Cys Asp Leu Ser Gly lie 
290 295 300 

Thr Asp Ser Ser Glu Val Tyr Val Ser Gin Val Thr Gin Lys Val Phe 
305 310 315 

Phe Glu lie Asn Glu Asp Gly Ser Glu Ala Ala Thr Ser Thr Gly lie 
320 325 330 

His lie Pro Val lie Met Ser Leu Ala Gin Ser Gin Phe lie Ala Asn 
335 340 345 350 

His Pro Phe Leu Phe lie Met Lys His Asn Pro Thr Glu Ser lie Leu 
355 360 365 
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Phe Met Gly Arg Val Thr Asn Pro Asp Thr Gin Glu lie Lys Gly Arg 
370 375 380 



Asp Leu Asp Ser Leu 
385 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GGTACCTAAG TGAGTAGGGC GTCCGATCGA CGGACGCCTT TTTTTTGAAT TCGTAATCAT 60 

GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG 120 

CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG C C T AATGAGT GAGCTAACTC ACATTAATTG 180 

CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA 240 

TCGGCCAACG CGCGGGGAGA GGC GGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA 300 

CTGACTCGCT GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG 3 60 

TAAT AC GGTT ATCCACAGAA TCAGGGGATA ACGCAGGAAA GAACATGTGA GCAAAAGGCC 420 

AGCAAAAGGC CAGGAACCGT AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC 480 

CCCCTGACGA GCATCACAAA AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC 540 

TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 600 

TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCATA 660 

GCTCACGCTG TAGGTATCTC AGTTC GGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 720 

ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA 780 

ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG 840 

CGAGGTATGT AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGC TAC ACTA 900 

GAAGAACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG 960 

GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 1020 

AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT 1080 

CTGACGCTCA GTGGAACGAA AACTC AC GTT AAGGGATTTT GGTCATGAGA TTATCGTCGA 1140 

CAATTCGCGC GCGAAGGCGA AGCGGCATGC ATTTACGTTG ACACCATCGA ATGGTGCAAA 1200 

ACCTTTCGCG GTATGGCATG ATAGCGCCCG GAAGAGAGTC AATTCAGGGT GGTGAATGTG 1260 
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AAACCAGTAA 


CGTTATACGA 


TGTCGCAGAG 

X VJ X V* VJV_^TkVj^lVj 


tatppppptp 

X ft X vjVwVwVJVj X vj 


TPTPTTATP A 
XVwXV^l Xrt.1 Vwrt 


PAPPPTTTPP 
VjAVwVwVj 111 VwVw 


lion 
± J z U 


CGCGTGGTGA 


ACCAGGCCAG 


CPAPGTTTPT 

v_» v^-^iv— VJ X X X v> X 


PPPAAAAPPP 

VjVw VJrtrtrtrtVw VjVw 


PPP A A A A BPT 
VjVjVjrtArtrtAVj 1 


ppA A PP'P'PPP 
VjVj AAVjC VjVjVw Vj 


1 1 OA 


ATGGCGGAGC 


TGAATTACAT 

X Vjiiii X X XX Vw- Xi. X 


TCPPAAPPPP 

X Vw V_.fl_fJ.V_ \_ VjV_ 


ptpppap a ap 

VJ X VjvjVwrtvw-rtrtVw 


rtrtVw 1 VjVjVwVjVjVj 


p« A A 7\p Ap^PP 
Vw AAAU AVj 1 VwVj 


1 A A C\ 


TTGCTGATTG 


GCGTTGCCAC 


CTCPAGTPTG 

V* X v_ vriVJ X v_- X Vj 


VjVwVwVw X VjVwrt.V_Vj 


VwVjVwV^Vj 1 L-VjVwrt 


a a r p r rp ,r pp , p , p , p' 

AA 1 1 Vj 1 VwVjVwVj 


IjUU 


GCGATTAAAT 


CTCGCGCCGA 

\w J. V^ VJV_ V- VJJXi 


TPAAPTPPPT 

X V*rtfl\, X VjVjVj X 


nPPAPtPHTPP 
VjVwVwrt.VjVwVj 1 VjVj 


X VjVj 1 Vj 1 V^vjrt 1 


PPTAP7i A PP 1 A 
VjVj 1 AVjAAVwVjA 


1 C C Pt 


AGCGGCGTCG 


AAGPPTGTAA 


appppppptp 

fVVjV_ VjVjVw VjVj X Vj 


PAPA Z\ TPTTP 
VwrtVwrtrt. 1L1 1L 


1 VwVjV-VjV^rtAVwVj 


PPTP A PTiPP P 
UVj 1 Vw AVj 1 VjVjVj 


1620 


CTGATCATTA 


APTATPPGPT 

X X v_V_ \Jv_ X 


PPATPAPP AP 

VjvjJ-i X vj.rt.v_ V_ rt Vj 


Pi A tppp a TTP 

Vjrtl VjVwVwAl 1 Vj 


Vw 1 Vj X VjVjrtAVjlw 


1 VjVwVw 1 VjL AOT 


1680 


AATGTTCCGG 


PfiTTATTTPT 

VwVJ X X *i XXX Vw X 


tp atptptpt 

X Vjrt X Vj 1 Vw 1 Vw X 


p a pp a p a p a 0 

vj/\V_ v_/\vj/\L- AL 


P 1 P* A T>P* A A P 1 A P* 

L.Vw A 1 LAALAb 


TATTATTTTC 


1740 


TCCCATGAAG 


apggtapppp 

*^Vw vjvj x xiV_* VJVw Vj 


aptppppptp 

flv_ X VjVjvjV_ Vj X Vj 


P.APPZ1TPTPP 
Vj.rtVjVw.rt 1 Vw X VjVj 


fpPPP A T"T 1 PPP 
IVwVjVwrtX 1 VjVjVj 


rpp a PP A PP A A 

1 VwAVwCAVjVwAA 


1 O A A 

1800 


ATCGCGCTGT 


TAGPGGGPPP 

X JiVJVy VJVjvjVw Vw V— 


ATTAAPTTPT 

rt X X rtrtVj X X Vw X 


Vj 1 Vw 1 VwvjvjVw vjL 


Vj 1 Vw 1 VjL.Vj 1 Vw I 


VjVjVw 1 VjVjL 1 VjVj 


1 O CA 

1860 


CATAAATATC 


TCACTCGPAA 

X X Vj» VJ V_* 


TP A A ATTP AP 
x v.n/iri x x Vwrtvj 


pppaTapppp 

VwVwVjrt X rtVjVwVjVj 


a ar , r , r , r - ' a app 

rtrtVw Vj Vj VjArtVjVj 


pp a r^T^r^r* a pm 

Vw Vj AVw 1 VjVjAVj 1 


1 Ann 


GCCATGTCCG 


GTTTTCAACA 

vj x x x x v«rmv«n 


AAPPATPPAA 

rt-riVwVw/A X VjVwrtrt 


ATPPTP A 21TP 
rt. X VjVw X Vjrtrt X Vj 


appppa T'p'p'T 1 

AvjVjVjVwA 1 VwVj 1 


mpp p a P T»pp p 

1 LtLAL 1 VjVwVj 


1980 


ATGC TGGTTG 


CCAACGATCA 


G ATGGPPP TP 

vjrt x vjvJVwVjVw . X Vj 


PPPPPA ATPP 
VjVjVwVJV_rtrt. X VjV_ 


VjVwVjVwlwrtk x 1 rtVw 


p* r* a c* t> P 1 p* p* P 1 P 1 

VwVjAVj 1 VwL, VjVjVj 


O A A Pt 


CTGCGCGTTG 


GTGCGGATAT 


PTPPPTAPTP 

Vw X Vw VjVj X xTkVj X Vj 


CP AT APP APP 
VjVjrt. X rtVw VjrtVw Vj 


ATAPPPA ApA 
rt, 1 rt,Vw Vw Vjrt-rt.Vjrt 


P A PpfPP A T>PT> 
L. AVjVw 1 LA 1 Vj 1 


0 1 Pt Pi 
z 1UU 


TATATCCCGC 


CGTTAAPPAP 

Vw VJ X X XXXi\w v_» nv^ 


PATPAAAP AH 

V_ /i. X v^rtjrtrt,Vwrt,Vj 


PATTTTPPPP 
Vjrt X X X X Vw VjV_ V_ 


x VjVw 1 VjVjVjVjVwrt. 


A A PP A P 1 ^P i~*tTi/~* 
AAVwVw AVjLVj 1 Vj 


A 1 /-A 

z JLoU 


GACCGCTTGC 


TGPAAPTPTP 

X VJV^iUlV,* X V— . X v_ 


tp a pppp p a p 

X V_rtvJvjVjv_ \_rtVj 


PPPPTPZi Apf 
VjVwVjvj x Vjrtrt VjVj 


PPA A TP APPT 1 

VjVwilrt X VwrtVjVw 1 


Vjl 1 VJVwVwLVjTL 


2220 


TC AC TGGTG A 


AAAGAAAAAP 


V_rtVwVwV_ X VjVjVwVj 


PPP a ATS PPP 
Vw Vw Vw AA 1 AVw Vj Vw 


rtArtVwVwVjVwVw 1 Vw 


1 VwVwLVwVjLVjCG 


*"> 0 0 p» 
22 80 


TTGGCCGATT 


P ATT A ATPP A 

^-A X x ftrt X VjVwrt 


pptpp.p arra 

VjVw 1 VjVjV_.rtVwVj.rt. 


C AP 1 P ,r P r P r rr , PT 1 
VwAVjVjI 1 IVwV^L. 


P* A P'T'P'P* A A A P 1 
VjAVw 1 VjVjrtAAVj 


Cw VjCjCjC AGTG A 


2340 


GCGCAACGPA 


ATT A ATPT A A 

rt X X nn X Vj X rtrt 


vj 1 lrtVjV_VjVwVjrt 


Al IVjIVwVjAUIw 


A A A P'P'P'P'P'P' A 
AAAVjC VjCjVw Vw A 


m P^ p m p /~t /~i m /~» 

TCGTGCCTCC 


2400 


CCACTCCTGC 


agttpppppp 

^i\J X X Vw VJOVJVJVJ 


PATPPATPPP 

Vwrt X VjVjrt X VjVwVj 


Vw VjVj/\ X AVjVw Vw Vj 


P* T'P' P 1 T*P* P 1 Ti TlTi 
lw X VjVw 1 VjVj 111 


Vw Vw Tbb ATGC C 


2460 


GACGGATTTG 

\j**v vjvjr* xxx vj 


PAPTPPPPPT 


AHA APTPPPP 

rt Vjrtrt,V_ l VjVw 


VjAVjVj ILbl L.V^ 


a p ,1 p ,, p"t'p , a ppp 
AVjVwVw 1 UAVjVjVw 


A PP 7\ PPmP A 7\ 

AvjC AvjCw TGAA 


2520 


CCAACTCGCG 


AGGGGATPGA 


PPPPPPPPTP 
VJVw V_ Vw VjVjVjVj X VJ 


VjVjVw VjA/\Vj/\/\Vw 


rpp PAPPA T"P a 

1 VwVwAvjVw A 1 VjA 


Vj A 1 Vw L C Vw VjL Vj 


2580 


CTGGAGGATC 


ATCCAGCCGG 


CGTPPPGGAA 

VwVJ X VwVwVw VjVJrtrt 


A APP ATTPPP 
/xrVVw \JJr\ X X V^ V^ Vj 


A APPPPA App 
rtrt VjVw Vw Vw rtA Vw Vw 


mmmp A T" A P A A 

1 1 ILAIAVjAA 


O £T /I Pi 

2 b4 U 


GGCGGCGGTG 


G AATP G AAAT 


ptpptpatpp 

v_ X VwVJ X \J-ii. X VjVj 


PAPPTTPPPP 
Vw-rt.VjVj X X VjVjVjVw 


PTPPPTTPPT 1 
Vj 1 VwVjVw 1 1 VjVj 1 


LvjVjILAI 1 1L 


ot nn 
2 /UU 


GAACCCCAGA 


GTCCCGPTPA 


paapaaptpp 

vjfl_rt.vjf\_fi.V_ X Vw Vj 


TP A APA SPPP 
X Vwrtrt.Vjrtrt.Vj VjVw 


PSTAPA Appp 
VjA 1 AVjAAVjVjVw 


P A T<PP PP rri/ - ^ p 
VjA 1 VjL VjL 1 LL 


O *7 /T A 

2 7 60 


GAATCGGGAG 


CGGCG AT APP 


GTAAAPPAPP 

vj x r^rvf-iVjVwr\VwVj 


APP A APPPPIT 1 
rt.VjVjrtrt.VjVw VjVj 1 


PZlPPPPATTP 

VwrtVjVwlwVwAl I Vw 


VjL L VjL L AALL 


O O O Pi 
282 U 


TCTTCAGCAA 


TATCACGGGT 


AGCCAACGCT 


ATGTCC TG AT 


AGCGGTCCGC 


CACACCCAGC 


2880 


CGGCCACAGT 


CGATGAATCC 


AGAAAAGCGG 


CCATTTTCCA 


CCATGATATT 


CGGCAAGCAG 


2940 


GCATCGCCAT 


GGGTCACGAC 


GAGATCCTCG 


CCGTCGGGCA 


TGCGCGCCTT 


GAGCCTGGCG 


3000 


AACAGTTCGG 


CTGGCGCGAG 


CCCCTGATGC 


TCTTCGTCCA 


GATCATCCTG 


ATCGACAAGA 


3060 


CCGGCTTCCA 


TCCGAGTACG 


TGCTCGCTCG 


ATGC GATGTT 


TCGCTTGGTG 


GTCGAATGGG 


3120 
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CAGGTAGCCG 


GATCAAGCGT 


ATGCAGCCGC 


CGCATTGCAT 


CAGCCATGAT 


GG AT AC TTTC 


3180 


TCGGCAGGAG 


CAAGGTGAGA 


TGACAGGAGA 


TCCTGCCCCG 


GCACTTCGCC 


CAATAGCAGC 


3240 


CAGTCCCTTC 


CCGCTTCAGT 


GACAACGTCG 


AGCACAGCTG 


CGCAAGGAAC 


GCCCGTCGTG 


3300 


GCCAGCCACG 


ATAGCCGCGC 


TGCCTCGTCC 


TGCAGTTCAT 


TCAGGGCACC 


GGACAGGTCG 


3360 


GTCTTGACAA 


AAAGAACCGG 


GCGCCCCTGC 


GCTGACAGCC 


GGAAC AC GGC 


GGCATCAGAG 


3420 


CAGCCGATTG 


TC TGTTGTGC 


CCAGTCATAG 


CCGAATAGCC 


TCTCCACCCA 


AGCGGCCGGA 


3480 


GAACCTGCGT 


GCAATCCATC 


TTGTTCAATC 


ATGCGAAACG 


ATCCTCATCC 


TOTCTCTTHA 


3540 


TCAGATCTTG 


ATCCCCTGCG 


CCATCAGATC 


CTTGGCGGCA 


AG AAAGC CAT 


CCAGTTTACT 


3 600 


TTGCAGGGCT 


TCCCAACCTT 


AC C AG AGGGC 


GCCCCAGCTG 


GCAATTCCGG 


TTC GC TTGCT 

x x vjv x x vjv> x 


3660 


GTCCATAAAA 


CCGCCCAGTC 


TAGCTATCGC 


CATGTAAGCC 


C AC TGCAAGC 


TACCTCIPTTT 

X X XXX 


**7? 0 

■J 1 £* \j 


CTCTTTGCGC 


TTGCGTTTTC 


CCTTGTCCAG 


ATAGCCCAGT 


AGCTGACATT 


CATCCGGGGT 


3780 


CAGCACCGTT 


TCTGCGGACT 


GGCTTTCTAC 


GTGTTCCGCT 


TCCTTTAGCA 


GCCCTTGCGC 


3840 


CCTGAGTGCT 


TGCGGCAGCG 


TGAAGCTTAA 


AAAACTGCAA 


AAAATAGTTT 


GACTTGTGAG 


3900 


CGGATAACAA 


TTAAGATGTA 


CCCAATTGTG 


AGCGGATAAC 


AATTTCACAC 


ATTAAAGAGG 


3960 


AGAAATTACA 


TATG 










3974 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AAGCTTAAAA AACTGCAAAA AATAGTTTGA CTTGTGAGCG GATAACAATT AAGATGTACC 60 
CAATTGTGAG CGGATAACAA TTTC AC AC AT TAAAGAGGAG AAATTACATA TG 112 
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